PRESCRIPTIVE KANJI SIMPLIFICATION 


by 


KEN ROGER LUNDE 


A thesis submitted in partial fulfillment of the 


requirements for the degree of 


Doctor of Philosophy 


(Linguistics) 


at the 
UNIVERSITY OF WISCONSIN-MADISON 
1994 


© Copyright by Ken Roger Lunde 1994 


All Rights Reserved 


PRESCRIPTIVE KANJI SIMPLIFICATION 


Ken Roger Lunde 
Under the supervision of Professor Andrew Sihler 


at the University of Wisconsin-Madison 


Abstract 


The simplification of Chinese characters (kanji) is a recognized historical process. 
Normally such a simplification grows spontaneously out of the demands of everyday 
use, but simplification by fiat is at least a theoretical possibility. Such a round of 
simplification by fiat, here called ‘prescriptive simplification,’ seems to have taken 
place in the PRC in 1977. What has gone unrecognized is that simplification by fiat 
was also attempted in Japan in 1983, albeit on a much smaller scale, and emanating 
from the government-controlled organ called the Japanese Standards Association. 
In this dissertation I marshal evidence to show that prescriptive kanji simplification 


has occurred in Japan, and discuss its consequences. 
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Chapter 1 Introduction 


This dissertation demonstrates, through evidence and testimony from both 
Japan and China, that a phenomenon called prescriptive kanji simplification exists, 
and also explains how it contrasts with spontaneous and descriptive 
simplification—the current literature dealing with kanji simplification make no 
such claim. In addition, I attempt to show that there is a varying level of public 


acceptance of prescriptive kanji simplifications.’ 


This dissertation does not provide exhaustive lists of simplified kanji, but 


examples are provided when necessary to illustrate concepts. 


"The term kanji refers to the characters that the Japanese borrowed from China—called hanzi in 
Chinese and hanja in Korean. In this dissertation, the term kanji is used generically to refer to 
Chinese characters used in Japan, China, and Korea. 


Literature Review 


Many papers, articles, and books have classified kanji simplifications into 
several types or principles; some provide exhaustive lists of simplified kanji allographs; 


and some even undertake to explain why kanji are simplified. 


Several reference works, including many kanji dictionaries, most of which 
are written in Japanese, were critical in preparing this dissertation. However, none 
of the references that describe kanji simplification make a claim that prescriptive 


kanji simplification exists. 


The 12-volume series entitled Kanji Koza GRFitt AB ‘Kanji Course’), Sato et 
al. (1987-1989), provides detailed information on the development of kanji in 
Japan, including detailed information on simplifications. However, at no time are 


there discussions about prescriptive simplification, as treated in this dissertation. 


Hayawakari Chiigoku Kantaipi 2d 0 PEA ‘A Quick Understanding 
of Chinese Simplified Characters’), Endo (1986), provides details about kanji 


simplifications in PRC (Mainland China), and draws comparisons with those in 


Japan. This book also fails to bring up the notion of prescriptive simplification as a 


way to explain some aspects of kanji simplification in PRC. Endo (1986), however, 
provides evidence for my claim that prescriptive kanji simplification has recently 


been attempted in PRC. 


Another important reference is a short paper entitled JIS C 6226 Joho 
Kokan-y6 Kanji Fugokei no Kaisei (IS C 62261R#R ACR ALES AO WUIE 
‘Revisions to JIS C 6226 Code of the Japanese Graphic Character Set for Information 
Interchange’). This article provides evidence which shows that simplification by 
prescription recently took place in Japan. This article is referred to as Nomura 


(1984) in this dissertation. 


Other references include various authoritative kanji reference works, such 
as dictionaries, and a collection of various Japanese character set standards. The 
character set standards are JIS X 0208-1990 (and its 1978 and 1983 versions), JIS X 


0212-1990, Joyo Kanji, and Jinmei-y6 Kanji. 


Conventions 


Several acronyms are used throughout this dissertation, and are described 


as follows: 


jis Short for Japanese Industrial Standard (AA T3kRis Nibon Kogy6 
Kikaku), the name given to the documents that are published by JSA 


(described next). 


JSA Short for Japanese Standards Association (HAR Ge Nibon Kikaku 
Kyokai), the organization of the Japanese government that oversees 


standards practices, and publishes JIS documents. 


PRC Short for the People’s Republic of China, meaning Mainland China. 


ROC Short for the Republic of China, meaning Taiwan. 


I denote long vowels with a macron when providing the readings for 


Japanese words. Chinese readings adhere to the Pinyin system of romanization. 


KUTEN (E.R ‘row [and] cell’) codes supplement some kanji lists. Providing 
these codes makes it easier to locate the actual characters within a Japanese 


character set standard. 


Chapter 2 The Japanese Writing System 


The Japanese writing system is composed of four scripts: Roman characters, 
hiragana, katakana, and kanji. Hiragana and katakana are collectively referred to as 


kana. 


Roman Characters 


Roman characters (U—Y‘f r6maji) as used in Japan are mostly found in 
acronyms and numerals. The set consists of the lowercase and uppercase letters of 


the English alphabet plus the numerals zero through nine. 


Kana 


Kana Uk&4 kana) are syllabaries, and their shapes were derived from 
kanji. A syllabic sign stands for a vowel, of which there are five in Japanese, either 


alone or preceded by a consonant. These consonants, whose traditional order is 


based on Sanskrit, are k, s, ft, n, b, m, y, r, and w. Additional phonemes are 
indicated by means of diacritics or special character combinations. There are two 


types of kana: hiragana and katakana. 


Hiragana 


Hiragana CEAR% hiragana) are one of the kana sets, and were derived by 
cursively writing kanji, but are used only for the phonetic value of their readings. 
Hiragana are generally used to write native Japanese words, such as grammatical 
words, inflectional endings for verbs and adjectives, and some nouns. Table 2-1 


illustrates the basic set of hiragana characters, conforming to modern usage.’ 


Table 2-1: The Hiragana Syllabary 


H 
ze 
O 
S 
a 


K 
yay 
a 
< 
(Tt 
sa (Zt 


Omace>,p 

oF Ur TS Bt 

Neh OUD 
rA UU ores 
Se NF Z 
FSP WE 
~ S&S Ae 
NA WO UA 
yw OYs 


Tables 2.1 and Table 2.2 (on page 7) are missing the character that represents the syllable-final 
nasal consonant. This character is /v for hiragana, and ~ for katakana. 


The horizontal axis of Table 2-1 (and of Table 2-2, which appears in the following 


section) represents the first part of the syllable, and its vertical axis represents the 


second part. For example, 4 is read a, and 2 is read ka. 


Katakana 


Katakana (Fr{2% katakana) are generally used in two ways: to write 
words of foreign origin, called gairaigo O}Kit gairaigo), and for emphasis—similar 
to the use of italics to represent foreign words and to express emphasis in English. 
For example, the Japanese word meaning ‘bread’ is written /\-” and read pan. 
This word was borrowed from the Portuguese word pdo. Katakana are also used 


to write foreign names. Table 2-2 illustrates the basic set of katakana characters, 


conforming to modern usage. 


Table 2-2: The Katakana Syllabary 


OMCs > 
MHA 
UN tan 
SL ANnvED 
7 SS BN SS 
See [TS 
ar ET Se 
HePwAZ 
HW aK 
OVSTen 
UH HU 


LL 


Kanj 


The word kanji GF kanji) literally means ‘Chinese character.” These 


characters were borrowed from the Chinese over 1,500 years ago. Kanji were not 


usually borrowed as individual characters, but as compounds containing two or 


more kanji. Table 2-3 provides some examples of kanji and kanji compounds. 


Table 2-3: Kanji and Kanji Compounds 


Kanji Gloss Compound 
a person 

ES medicine BR 

Be tooth pa Se 

A eye Aa 

FI self 

i) move Fay 

Hl vehicle A ayaa 


The Structure of Kanji 


Gloss 


doctor 
dentist 
optometrist 


automatic 
automobile 


Analogy 


‘medicine person’ 
‘tooth doctor’ 
‘eye doctor’ 


‘self-movement’ 
‘automatic vehicle’ 


Kanji are organized by a system of 214 radicals, which generally represent 


words, but are important for their systematic combination—in compressed or 


otherwise reduced form—with phonetic characters. The compound characters thus 


*In PRC these characters are called hanzi (1%), and in Korea they are called hanja GAP). 


9 
formed make up the majority of kanji in use today. Table 2-4 provides examples of 


radicals, their variant forms, their glosses, and kanji composed with them.‘ 


Table 2-4: Kanji and Their Radicals 


Radical Variant(s) Gloss Kanji Formed From Radical 

xi 6 earth ih (ground), 9x (slope), 5% (cultivate), Hk (ditch) 
7K tree It (timber), #K (woods), HE (root), # (forest) 

7K Y,-K water iF (sweat), 7 (steam), 5X (fountain), #% (wave) 
*K an fire PR (ash), F furnace), f (smoke), #4 (heat) 

oa insect i (mosquito), E (snake), 8 Cfirefly), ' Cant) 
Xe ee running  3& (send), i8 (chase), i (advance), it (exceed) 
a metal $8 (silver), Hil (copper), BH (gun), Hi (steel) 


The concept of arranging kanji by these 214 radicals was introduced by 
Zhang et al. (1716), which is a classic and authoritative kanji reference work. 


Contemporary kanji reference works, in general, adhere to this system. 


The Pronunciation of Kanji in Japanese 


The typical kanji has at least two readings—some have more. For example, 
the kanji 4 Clife?) has 27 readings, most of which are used for Japanese given 


names, which typically have unusual readings. 


“The radical variants that are shown in Table 2-4 do not include squeezed or stretched variations 
of the base radical’s shape. 


10 


Kanji readings come from two sources: a native Japanese reading and a 


borrowed (and approximated) Chinese reading. The native Japanese reading is 


called the KUN reading (Wll%@4 Run-yomi). The borrowed Chinese reading is the 


Japanese-language approximation of the native Chinese reading of a kanji. These 


borrowed approximate readings are called ON readings (Hit on-yomi). Some 


kanji have one or more ON readings, but no KUN readings; some have one or 


more KUN readings, but no ON readings.’ Table 2-5 provides several examples 


of kanji with a variety of readings. 


Table 2-5: Kanji and Their Readings 


Kanji 
iG 
AE 


BA 
FIN 


> 


Gloss 
vehicle 


shop 
life 


probibit 
gun 
come in 


frame 


ON Reading(s) 
sha 

ten 

sei, sho 


kin 


KUN Reading(s) 

kuruma 

mise 

ari, bu, fu, fuyu, haeru, hayasu, i, ikasu, 
ikeru, ikiru, ki, mi, nama, nari, nori, 0, 
oki, ou, susumu, taka, ubu, umareru, umu, 


yo 


komi, komu, komeru 


waku 


°Kanji with no ON readings generally fall into the category of Japanese-made kanji, called kokuji. 
Japanese-made kanji are addressed at the end of this chapter. 
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The Borrowing of Kanji From China 


The Japanese began borrowing the Chinese script over 1,500 years ago. 
This massive borrowing took place in three different waves. Several kanji were 
borrowed repeatedly at different periods, and the reading of each kanji was also 
borrowed again. This led to different readings for a given kanji depending on 
which word or words it appeared in, due to dialectal and diachronic differences in 


China over the period of the three borrowings. 


During all three waves of borrowing, most kanji were borrowed as compounds 
of two or more kanji, rather than as isolated characters. It is in this context that 
you find differences in reading of a particular kanji depending on what word it 
appears in. For example, the kanji 4, meaning ‘ten-thousand,’ can be found in 
kanji compounds with either the reading man or ban, such as Ji— (man’ichi ‘by 
chance’—literally, ‘ten-thousand to one’) and Ais (banzai ‘ten-thousand years 
old’). This man/ban alternation provides evidence that these two words were 


borrowed at different periods. 


The first two waves of borrowing had the most significant impact on the 


Japanese lexicon. Their sources were two different dialects of Chinese, which 


Lz 
accounts for the dual ON readings of many kanji. The third wave of borrowing 


had very little effect on the Japanese lexicon. 


Japanese-made Kanji 


The Japanese also created their own kanji, called kokuji (EI ‘Japanese-made 
kanji’).° Kokuji behave like true kanji, following the same rules of structure—namely 
that they are composed of radicals and strokes, and can be combined with one or 
more other kanji to form compounds or words. Most kokuji are used to represent 
words for indigenous Japanese plants and fish. They are also used quite frequently 


in Japanese place and personal names. 


Several hundred kokuji have been identified in Reiman (1990) and Hida 
(1990), and many were created when the Japanese isolated themselves from the 
rest of the world for approximately 250 years, from the mid-1600s to the late 
1800s. Without direct influence from China, the Japanese resorted to creating their 


own kanji. 


‘The kana scripts—hiragana and katakana—were also derived from kanji, and are sometimes 


referred to as kokuji. 
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Chapter 3 Japanese Character Set Standards 


This chapter describes two types of Japanese character sets: non-electronic 
and electronic. These collections of characters were created for different purposes 
and by different organizations, but they are related, and often affect each other’s 
development. Character sets provide a convenient way to arrange and organize the 


most commonly used characters. 


In the West, relatively small character sets are designed to include all the 
characters used to express a writing system. In Japan (and in China and Korea), 
characters, mostly kanji, number in the tens of thousands. This makes it nearly 
impossible for a character set to contain all characters, and the principle of the 
character set, namely a selection of characters made with some specific purpose in 


mind, is a matter of practical necessity. 
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Non-electronic Character Sets 


Non-electronic character sets in Japan were designed for use in education, 
rather than in the computer field. Kana and Roman characters form sufficiently 
small character sets, but since kanji number in the tens of thousands, there was a 
need to limit the number in common use. These non-electronic character sets 
provided a way to limit the number of kanji in general use, and as such, contain 


only kanji. The largest set contains 1,945 kanji. 


Joyo Kanji CH FIGS ‘Everyday-use Kanji’) contains 1,945 kanji that are 
required for daily life in Japan, such as for newspapers and other forms of media. 
Prior to 1981, this set contained 1,850 kanji, and was called Toyo Kanji (4 HAF 


‘Common-use Kanji’). 


Gakusht Kanji (##v2F ‘Educational Kanji’) is a subset of Joyo Kanji, and 
contains 1,006 kanji. These kanji are formally taught during the first six years of 
education in Japan. Prior to 1992, this set contained 996 kanji. This set once 
contained only 881 kanji, and was called Kyodiku Kanji (ACY ‘Instructional 


Kanji’). 
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Joyo Kanji can be used for writing many Japanese given and surnames, and 
Jinmei-yO Kanji (A4%/ AWS ‘Personal-name-use Kanji’) is a set of kanji that 
supplements JoyO Kanji, and authorized for use in writing Japanese given and 
surnames. Currently, Jinmei-yo Kanji is comprised of 284 kanji. Prior to 1990, this 


set contained 166 kanji. 


Electronic Character Sets 


Electronic sets were specifically designed for use with computer systems, 
and contain kanji, kana, Roman characters, and other assorted characters and 
symbols necessary for expressing Japanese in writing. Non-electronic sets constitute 
a subset of these sets. The total number of kanji in the electronic sets comes to 
12,156, all unique—approximately six times the number of kanji found in the 
non-electronic sets. The electronic sets include JIS X 0201-1976 (157 non-kanji), JIS 
X 0208-1990 (6,355 kanji and 524 non-kanji), and JIS X 0212-1990 (5,801 kanji and 


266 non-kanji). 


JIS X 0201-1976 


The JIS X 0201-1976 character set contains two collections of characters: 


JIS-Roman and half-width katakana. JIS-Roman represent the characters necessary 


16 
for English-language documents, such as the lowercase and uppercase letters of 
the English alphabet, numerals, punctuation, and symbols. Half-width katakana 
represent the minimum number of characters necessary to express Japanese using 
kana. JIS-Roman enumerates 94 characters, and half-width katakana enumerates 63 


characters. 


JTS X 0208-1990 


The JIS X 0208-1990 character set represents the most basic electronic 
character set standard in Japan that supports kanji. A total of 6,355 kanji are 
enumerated, and are separated into two levels: JIS Level 1 kanji (2,965 kanji 
arranged by reading) and JIS Level 2 kanji (3,390 kanji arranged by radical then 


number of strokes). 


Table 3-1 provides examples for each of the character classes contained in 


the JIS X 0208-1990 standard. 
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Table 3-1: Sample Characters from the JIS X 0208-1990 Character Set 


Symbols voy me CL EPI ~ PARES PTS I) 
Numerals 0123456789 

Roman characters ABCDEPGHIJI ~ qrstuvwuey=z 
Hiragana BHOWVSADRAZABS ~ VENAHDDABEN 
Katakana P7744 G0az LAT ~~ UNV Uy Vase Sy 
Greek characters ABTAEZHOIR ~ ox ported xy%ya 
Cyrillic characters ABBrOAEEK3HN ~ puiwmmbibawa 
Line elements —| 524+ +t w~ he tbh 4H 
JIS Level 1 kanji TM GEE Bi] PLA PAAR EE ~ RE TERA PRLS Dili 
JIS Level 2 kanji ASAT UES FE) Safe ~ GBR Se 
Additional kanji Sah ite HERR 


This character set standard was first established on January 1, 1978 as JIS C 
6226-1978, modified for the first time on September 1, 1983 as JIS C 6226-1983 (on 
March 1, 1987, JSA redesignated this standard as JIS X 0208-1983—no substantive 
changes were made), and modified for a second time on September 1, 1990 as JIS 


X 0208-1990. 


JIS X 0212-1990 


A supplemental Japanese character set standard was published by JSA on 
October 1, 1990, and specified 6,067 characters, 5,801 of which are kanji. This 
collection of characters is in addition to those found in JIS X 0208-1990—together 


they enumerate 12,156 kanji. The 5,801 kanji in JIS X 0212-1990 are arranged by 


18 
radical then number of strokes as a single group—like JIS Level 2 kanji of JIS X 


0208-1990. 


Table 3-2 provides examples for each of the character classes contained in 


the JIS X 0212-1990 standard. 


Table 3-2: Sample Characters from the JIS X 0212-1990 Character Set 


£2 8 ©O®™ ON 
CLCogviba 


Symbols 2% ‘ | 
Greek characters A'EHTI'OTT Qa ~ ne 
Eastern European BresSi I Jibbo ~ siitjmenky a 
Alphabetic characters ADHULLNGOEF ~ iutiwyyyz2zz 
Supplemental kanji BLT ABI | Y ~ See ae 2a ak ik ae 


Cy YM 23- 


Character Set Relationships 


The non-electronic character sets in Japan, of course, predate the electronic 


ones, and they affect the development of the electronic sets in various ways. 


First, JSA decided that the entire set of 1,850 kanji found in Toyo Kanji must 
fall within the JIS Level 1 kanji set (2,965 kanji) of JIS C 6226-1978. In 1981, 95 


kanji were added to the 1,850 to Toyo Kanji, which now contains 1,945 kanji, and 
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is now called Joyo Kanji. Some adjustments to JIS C 6226-1978 were necessary, 


and were folded into JIS C 6226-1983 (now called JIS X 0208-1983). 


Second, JSA decided that the entire set of 284 Jinmei-yO Kanji must fall 
within either JIS Level 1 or 2 kanji. Due to additions made to the Jinmei-yo Kanji 
set at two different periods (46 kanji added in 1981, and 118 kanji added in 1990), 
slight adjustments and kanji additions were made to the 1983 and 1990 versions of 


JIS X 0208. 
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Chapter 4 Kanji Simplification 


Kanji Allographs 


Japanese kanji quite often have more than one form. These allographs fall 
into one of three categories. The traditional form CIE seiji) is the allograph that 
was codified by Zhang et al. (1716), and is also called the fully unsimplified or 
orthodox form. The standard form GHJH‘F tsaydji) is the allograph that is 
acknowledged as being in common use. The alternative forms GE(S itaiji) are 


the allographs, generally simplified, that are considered non-standard. 


The alternative forms are subcategorized into the simplified form (li§ 


i= 


ryakuji), the variant or vernacular form (f@ zokuji), and the handwritten 


abbreviation (42 3-lI§F bissha ryaku/i). 


Za 

The standard form is of course usually the traditional form, but sometimes 

is actually an alternative form that was used so widely that it has, in effect, 
supplanted the traditional form. Of course, alternative forms are usually considered 
non-standard. Many traditional forms that have standard simplified forms are also 


considered non-standard. 


Kanji Usage 


Kanji originally evolved in China, and were subsequently adopted in Korea 
and Japan. Today there are five major countries that use kanji: PRC, ROC, Hong 


Kong, Japan, and Korea. 


ROC, Hong Kong, and Korea use the traditional forms of kanji. Kanji as 
used in these countries are not discussed in this dissertation because they do not 


involve a simplification process. 


Japan makes use of both simplified and traditional kanji forms. 1,006 kanji 
are formally taught during the first six grades of school—this is the Gakusht: Kanji 
set. PRC makes use of highly-simplified kanji, and, in contrast to Japan, over 3,600 
kanji are formally taught during the first six grades of school. Table 4-1, taken 


from Endo (1986: 38), provides more detailed figures. 
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Table 4-1: The Number of Kanji Learned in PRC and Japan 


Grade PRC Japan 


1 750 80 
2 850 160 
3 600 200 
4 500 200 
5 500 1385 
6 400 181 


TOTAL 3,600 1,006 


The Chinese learn approximately 3.5 times more kanji than the Japanese 
during the first six years of education. According to Endo (1986: 38), the cognitive 
skills required for learning this larger set of kanji is not a factor of 3.5, but rather a 


factor greater than 12. 


Kanji that do not have a complex structure are not, in general, simplified. 
Examples of such kanji include those that represent numbers, as illustrated in 


Table 4-2. 
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Table 4-2: Kanji That Do Not Simplify 


Gloss 
one 
two 
three 
four 
five 
SIX 
seven 
eight 


nine 


rea gt Bill | 6 


ten 


Kanji simplification mainly eases the demands of writing by hand, but 
represents a net decrease in efficiency as far as mechanical Gncluding electronic) 
reproduction is concerned. Miller (1967: 136-7) and Unger (1987: 70) point out 
that simplification may complicate matters as long as it is necessary to recognize 
both the simplified and traditional forms. Table 4-3 provides examples of simplified 
and traditional kanji forms found in the same character set standard, namely JIS X 


0208-1990. 


Table 4-3: Simplified and Traditional Kanji Pairs in JIS X 0208-1990 


Simplified Location Traditional Location 

JIS Level 1 Ba JIS Level 2 
- JIS Level 1 eh JIS Level 2 
a JIS Level 1 ie JIS Level 2 
JK JIS Level 1 a JIS Level 2 
ka JIS Level 1 oH JIS Level 2 
LN JIS Level 1 fie JIS Level 2 
[x JIS Level 1 [a JIS Level 2 
a JIS Level 1 =] JIS Level 2 


Note that the simplified kanji allographs are located in JIS Level 1 kanji 
(frequently-used characters), and that the traditional kanji allographs are typically 
found in JIS Level 2 kanji (rarely-used characters). This is true for many more 


traditional/simplified kanji pairs. 


At least in Japan, few people were required to read and write kanji a few 
hundred years ago, but now that the public is required to learn and use kanji, a 
need arose to simplify and standardize their shapes. Simplification is a way to 


allow more people to learn more kanji. 
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Kanji Simplification Principles 


When kanji simplifications are analyzed, it is possible to recognize particular 
principles. And, as we will see in Chapter 5, these simplification principles can 
then be applied to kanji to create new forms. Some of these principles include 
homophonic substitution, cursive simplification, outlining, shape extraction, and 


symbolization. The information in this section is taken from Endo (1986). 


Homophonic substitution. A complex kanji is replaced by a kanji with a much 
simpler structure, but with the same reading. An example is the kanji #§ Chow 
many,’ ‘some’) being replaced by JL Cdesk,’ ‘small table)—both share the reading 
ki. Another example is the kanji #Z Cdry’) being replaced by - Cdry’)}—both share 
the reading kan. This type of simplification is widely used in PRC, and serves to 


reduce the number of kanji. 


Cursive simplification. Either the entire kanji shape, or a portion thereof, is simplified 
in a way that would closely represent a cursive writing style. An example is the 
kanji Ed (Roku ‘country’) being simplified into the shape El. The shape 3%, when 
written quickly with a brush or other writing instrument, looks somewhat like the 


shape =k. 
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Shape extraction. A portion of the kanji is preserved in the simplification process—the 
remaining portions are removed entirely. An example is the kanji 2 (sei ‘voice’) 
being reduced to the shape . The upper-left portion is the extracted part. 


Another example is the kanji #% (go ‘sign’) being reduced to %. 


Outlining. Kanji that have enclosing shapes, such as boxes, can be simplified by 
eliminating some or all of the strokes within such an enclosure.’ An example is 
the kanji [Mi (men ‘surface’) being reduced to (1. This particular example is taken 


from the second kanji simplification in PRC (to be discussed in Chapter 5). 


Symbolization. A portion of a kanji, usually the complex portion, is more or less 
arbitrarily replaced by another shape. An example is the shape [ii (Ru ‘district,’ 


‘ward’) being reduced to . 


Kanji Simplifications in Japan and PRC 


If we compare the kanji used in PRC and Japan, we find that there are 


several interesting categories of difference and similarity. 


7Outlining can be considered a subcategory of the shape extraction principle. 


af 
No simplification in either PRC or Japan. Kanji in this category are typically simple 


by design, and thus never required simplification. Table 4-4 provides some examples. 


Table 4-4: No Simplification in Either PRC or Japan 


Gloss 
character 

up 

middle 

down 

sun, day 
moon, month 


op ED ay he ae 


big 


td 
” 


small 


| 


one 
two 


| | 


Simplified form in PRC, but traditional form in Japan. Kanji in this category have 


been simplified in PRC, but not in Japan. Table 4-5 provides some examples. 
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Table 4-5: Simplified Form in PRC, But Traditional Form in Japan 


Traditional Form 


1 oe } YW wm 


‘ 
A 


gE a A 


PRC Simplification 


Bee aM HK AK SS 


Gloss 
Seeing 
shell 

long 

time, bour 
head 
horse 
learning 
lying 
muddy, vague 
cocoon 


Many of the kanji in this category constitute radicals, and are found as parts 


of many other kanji. This causes simplification to propagate to many other kanji. 


Table 4-6 lists some examples of this phenomenon. 


Table 4-6: Simplified Radicals Propagating to Other Kanji 


Traditional Radical 
Bh 
A 
RS 


Simplified form in Japan, but traditional form in PRC. Kanji in this category have 


been simplified in Japan, but not in PRC. These serve as counter-examples against 


Examples 
FL 
Fae ie et 
Be SEER EK 


Simplified Radical Examples 
Sil TEAL LANL 
yl WE BENE 
y 25 SEG a 
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the general notion that kanji simplifications in PRC are more extreme than in 


Japan. Table 4-7 provides some examples. 


Table 4-7: Simplified Form in Japan, But Traditional Form in PRC 


Traditional Form Japanese Simplification Gloss 

- me black 

He FF speech, dialect 

AF FF speech, dialect 

ii ts paying 

i {hs Buddhism 

i i previously, beforehand 
{Ex isa false, temporary 


Note how the two kanji, #/# and ¥f#, were collapsed into the single simplified 


Japanese form, namely ¥f. There was no such simplification or collapse in PRC. 


Japan and PRC both use simplified forms, but differently simplified. Kanji in this 
category have been simplified in both PRC and Japan, but to seemingly different 
degrees: the PRC form is generally more simplified. Table 4-8 provides some 


examples. 
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Table 4-8: Japan and PRC Both Use Simplified Forms, But Differently Simplified 


Traditional Form Japanese Simplification PRC Simplification Gloss 





ct cid W Asia 

ra ray wi both 

al bat ii tooth 

ie # A becoming 
{A {ili it value 

ing JK & wide 

ie Ve J child 

PH FB] enclosing 
ip Wigs IA returning 
cE i Lil group 


Same simplified form in both PRC and Japan. Kanji in this category have the same 


simplified form, usually through borrowing. Table 4-9 provides some examples. 
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Table 4-9: Same Simplified Form in Both PRC and Japan 


Traditional Form Simplified Form Gloss 


ft the present 

[ars DX district, ward 
@ a meeting, society 
a jai voice 

BL aL disordered 

He 4s body, form 

iN ts body, form 

a 4s body, form 
country 

at Ka point 

= a stand, counter for vebicles 
@ fs former 
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Chapter 5 Prescriptive Simplification 


This chapter introduces the concept of prescriptive kanji simplification, and 


shows how it relates to spontaneous and descriptive kanji simplification. 


Three Cases of Kanji Simplification 


Spontaneous Simplification 


Kanji are generally simplified spontaneously through imitation and frequent 
use. Writers create these simplifications, which spread to the public through 
imitation. It is generally the responsibility of an official organization, such as 
Japan’s Ministry of Education (SCHb44 Mombush6), to decide if and when a simplified 


form becomes standard. 
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Many kanji that are spontaneously simplified never become standard forms 

even though they are routinely used in certain special applications. For example, 
the kanji (vo ‘filter’), as it occurs in the two compounds HK (roshi ‘filter 
paper’) and 7Fif4 (roka ‘filtration’), is the simplified form of %a. Even though this 
simplified kanji is used frequently in some fields of science, such as chemistry and 


biochemistry, it is not found in either JIS X 0208-1990 or JIS X 0212-1990.® 


Descriptive Simplification 


It has become increasingly important to create character sets in order to 
standardize the number and shapes of kanji in common usage. A government 
body, such as a standardization organization, usually undertakes this task. In 
Japan, this is accomplished by JSA. The kanji shapes of a subset, namely Joyo 


Kanji and Jinmei-yO Kanji, are controlled by Japan’s Ministry of Education. 


Descriptive simplification occurs when this government body merely catalogs, 
records, or describes the way in which the people have simplified kanji. This is 
similar to spontaneous simplification, but includes the additional process of 


standardizing a particular form. 


I have found this simplified kanji in XCCS (Xerox Character Code Standard) and in the Unicode 
character set. 
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Prescriptive Simplification 


So far the discussion has treated matters of common knowledge, but in 
Japan there has been another qualitatively different process of kanji simplification 


which has not yet been noted in the literature. 


Prescriptive simplification is a way to force change, and is substantially 
artificial. In this case, simplification principles are extracted from simplified kanji 
(that is, kanji that have been spontaneously simplified), then applied to kanji. In a 
way, this can appear to be a method to standardize kanji shapes, but many kanji, 
even complex ones, have no compelling reason to be simplified because they are 


not used frequently enough to justify simplification. 


Phenomena in Japan 


The first phenomena that we examine come from Japan. This is the matter 
of the kanji simplifications brought forth with the Joyo Kanji set in 1981, and how 
these simplifications were extended to kanji in the JIS X 0208-1983 standard. The 
silent restoration of kanji to the JIS X 0212-1990 standard provides testimony for 
my claim that prescriptive simplification took place during the transition of JIS C 


6226-1978 into JIS X 0208-1983. 
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Joyo Kanji Simplifications 


The Joyo Kanji set, which consists of 1,945 kanji, is the set of kanji that has 
undergone the most simplification in Japan. The JoyO Kanji set contains the basic 
kanji required to read most Japanese texts. This means that these 1,945 kanji are 
among the most frequently used. As was asserted in Chapter 4, kanji that are used 
very frequently are generally simplified. This is exactly what happened in the case 


of Joyo Kanji. 


The simplified kanji found in Joyo Kanji were recognized and utilized by 
the Japanese Ministry of Education in order to make the basic set of kanji easier to 
learn. The simplified forms specified by the Japanese Ministry of Education may 
have been a departure from previous standards, but were in common use as Miller 


(1967: 136) points out. The government merely cataloged them. 


Joyo Kanji Simplifications Applied to JIS X 0208-1983 


What is unmistakenly a case of prescriptive simplification can be found in 
the transition from JIS C 6226-1978 to JIS X 0208-1983 Gn clearer terms, between 
the 1978 and 1983 versions of the JIS X 0208 series). This issue is still being 


debated to this day in some circles. 
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In Chapter 3 we learned that the 1,945 Joy6 Kanji are a subset of the 2,965 

JIS Level 1 kanji. One of the requirements for the JIS X 0208 series is to keep pace 
with the development of the non-electronic character sets. This means adding 
more kanji when necessary, but it also means adhering to the kanji simplifications 
brought forth by the JoyO Kanji set. Therefore, changes made to the non-electronic 
sets often result in changes to the electronic ones. This is almost like a fourth type 
of simplification, namely authoritative simplification. The JIS X 0208 series does 


not affect the development of Joyo Kanji and Jinmei-yO Kanji—at least not yet. 


Nomura (1984: 5) describes the transition from JIS C 6226-1978 to JIS C 
6226-1983 (JIS X 0208-1983, as designated today), and enumerates six categories of 
changes made to kanji shapes along with descriptions of the changes and example 
kanji. Nomura’s third category is the one that provides evidence for prescriptive 
simplification. Its description states that JoyO Kanji simplification principles were 
applied to kanji found in JIS Level 1 kanji. JIS Level 1 kanji contains 1,020 kanji not 


found in JOy6 Kanji. 


ee 


Tables 5-1 and 5-2 provide printouts of the first row of kanji (the kanji 
begin at row 16) from the JIS X 0208 series: the original 1978 version is illustrated 


in Table 5-1, and the 1983 version with kanji allograph changes emboldened is 
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illustrated in Table 5-2. These tables are significant in that they illustrate how a 
typical row of characters was modified during the transition from JIS C 6226-1978 
to JIS X 0208-1983. These tables represent the changes to just one row of 94 


characters—there are 68 such rows of kanji in the standards.’ 


Table 5-1: Row 16 of JIS C 6226-1978 


01 02 03 04 05 06 07 08 09 10 1112 13 14 1516171819 
0 HME RERBERHABEEBRILE FE B 
20 PE HE BR HR 3c HH We AB Ry RE AK BR BE 4G 2 ee PKG ZR OA 
RAO FMKEARAERARE BRR DGRARER 
OPM MA ABKHEREBRHKZRRRR-— Ela 
80 KF MRA A AWS Ke Ae 


Table 5-2: Row 16 of JIS X 0208-1983 


01 02 03 04 05 06 07 08 09 101112 13 1415 16 1718 19 
00 HM LER RE RGEC RAM RR E IRS 
20 #F HE Rt ie Se WE HC Ai ity) We AG Be SR AG BS YAS TK Me SR 
OMB DR MAKER ARAM ARR DA RR 
ocBM MAB KAR REH KM A MR Site 
80 fia =F BS Ft Fl Oe OM we S| Ok PR al 


*The vertical and horizontal axes of Tables 5-1 and 5-2 are added together to obtain the cell value 
for a character—the row value for both tables is 16. For example, the cell value for the kanji #* is 
70 (60 plus 10), which forms a KUTEN value of 1670. 
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The first two kanji from row 16 above exemplify simplification applied to 

kanji not part of the Joyo Kanji set. The very first kanji in the JIS C 6226-1978 (and 
JIS X 0208-1983) standard is Hf, a simplification of Ht. This character is also a 
component of many other kanji, but only in its traditional form 4#, as in the very 
next kanji in the array, [ii. However, in JIS X 0208-1983, the simplified form Hf 
was introduced into Mi, and in many other kanji of which 5 was a component. 
Further examples of this policy of carrying the simplified forms of Joyo Kanji 
consistently through characters of which the original form was a component are 


illustrated in Table 5-3. 


Table 5-3: Simplification Principles That Affect Other Kanji 


Simplification Principle 1978 Allograph 1983 Allograph 


i => Mg Ie 
B= # ja ke 
ad => ia i 
aE => 38 ee ia 

oo pia 
=> 7 if be 
A= #4 ile ff 
m= FF ihe tf 
=> BE He 
fz => Hie iN 

lat BE 
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The same simplification principle can affect more than one kanji, such as 
the principle a = K causing 4 to reduce to 4K, and Mi to reduce to BB. Some 


kanji are affected by more than one simplification principle, such as jis# reducing to 


i. 


Evidence in JIS X 0212-1990 


In the JIS X 0212-1990 standard, 28 simplified kanji of the JIS X 0208-1983 
standard were restored to their allographs as found in the JIS C 6226-1978 standard. 
Page 81 of the JIS X 0212-1990 manual contains a table that lists 28 of its kanji that 
were once found in JIS C 6226-1978, but disappeared in JIS X 0208-1983."° One 
such example is lit!, which was reduced to lf in JIS X 0208-1983. A table illustrating 
all of these 28 simplified/traditional kanji pairs is found in Appendix C. All the 
traditional kanji allographs listed in Table 5-3 (the 1978 allographs) are among 


these 28 restored kanji. 


The restoration of these 28 kanji in JIS X 0212-1990 amounts to an admission 
on the part of JSA that is was a mistake to carry simplified JoyO Kanji through 


compound characters, in which the simplexes were a component. 


This table in the JIS X 0212-1990 manual contains only these 28 kanji, nothing more. 
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At a recent JSA meeting, Kazuo Tajima" (FHIB—X Tajima Kazuo) of Iwaki 
Meisei University (v4) = HHEK Iwaki Meisei Daigaku) made a statement to the 
effect that care needs to be taken so that similar errors, like those made during the 


transition between the 1978 and 1983 versions of JIS X 0208, do not occur again.” 


Relative Acceptance of Kanji Simplifications 


People who do not use rare kanji forms more readily accept prescriptively 
simplified kanji—only due to the fact that they do not generally use such kanji, 
whether in traditional, simplified, or otherwise variant form. The average Japanese 
uses approximately 2,000 kanji in daily life, but there are 12,156 kanji available in 


the JIS X 0208-1990 and JIS X 0212-1990 standards. 


Notwithstanding the rationale of the simplification program embodied in 
the 1983 character set, people and organizations that actually make use of 
prescriptively simplified kanji generally do not care for them. This includes type 


foundries, publishers, and scholars. These people are familiar with traditional kanji 


"Kazuo Tajima served as the chairman of the committees for the JIS X 0208-1990 and JIS X 
0212-1990 standards, and is a leading figure in the field of Japanese character sets. 


The source of this information is personal communication with Taro Yamamoto (UA AEB 
Yamamoto Taro) from Adobe Systems Japan. 


4] 


forms, and prefer to use them. Strangely, these same people accept the Joyo Kanji 


simplifications. 


Some companies, such as Fujitsu Limited, IBM, and NEC have preferred to 
retain most of the JIS C 6226-1978 kanji allographs for their proprietary character 
sets. This means that the Japanese fonts used on computer systems manufactured 
by these companies make use of the allograph Mi (the 1978 allograph) rather than 


lif (the 1983 and 1990 allograph—of the JIS X 0208 series). 


Japanese type foundries, which have a much larger repertoire of character 
forms at their disposal, typically have both kanji allographs, such as Mii and Mi, 
available on their proprietary typesetting systems. This is to accommodate those 
that accept the allograph Mii and those that accept "i. Morisawa’s CEU UT 
Morisawa) proprietary code sets are called MOR-CODE I and MOR-CODE II. 
Shaken’s (5-Wt Shaken) are called SK-72 and SK-78. FDPC or the Font Development 
and Promotion Center (X74 bHH# > BRUYAY— Mofji Fonto Kaihatsu 
Fukyti Senta), the JSA-sponsored consortium that produces the Heisei CFB heisei)” 


series Japanese typefaces, also provides variant kanji allograph data, on the order 


of 1,000 additional kanji variants, per typeface. 


Heisei’ is the name of the current era in Japan, and began in 1989. 
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Treatment in Corporate Character Sets 


Japanese corporations generally follow the Japanese national standards, 
namely the character sets JIS X 0201-1976, JIS X 0208-1990, and JIS X 0212-1990. 
However, Japanese corporations either provide additional characters, or use 
allographs not specified in the latest JIS standard, and it is these latter which are 
under discussion here. Information in this section can be considered further 
evidence and testimony suggesting prescriptive simplification, at least for the 28 


kanji restored in JIS X 0212-1990. 


Fujitsu Limited’s corporate character set standard, FMR Kanji, follows the JIS 
X 0208-1990 standard—the latest of three versions—yet retains many JIS C 6226-1978 
kanji allographs, including all 28 restored in JIS X 0212-1990. Fujitsu Limited has 
been very careful to use the most correct and standard kanji forms in its products. 
This is because they are a leading supplier of computers and software to the 
Japanese government. Data that show JIS C 6226-1978 kanji allographs in a JIS X 
0208-1983 character set can be found in Fujitsu (1989). Consulting the latest 
manuals and personal communication via e-mail with Masahiro Sekiguchi, Fujitsu 
Limited’s leading authority on their character sets, indicated that FMR Kanji is now 


based on JIS X 0208-1990, and still retains the same JIS C 6226-1978 kanji allographs. 
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The IBM Japanese character set, called DBCS-PC, is similar to Fujitsu Limited’s 
FMR Kanji in that it is based on JIS X 0208-1990, yet still retains many JIS C 


6226-1978 kanji allographs.”’ 


Slightly less remarkable evidence can be found in NEC’s corporate character 
set, namely NEC Kanji. To this day, the standard character set used on NEC 
machines is still based on JIS C 6226-1978.” This means that all 28 kanji allographs 
restored in JIS X 0212-1990 are part of NEC Kanji. Preference for the JIS C 
6226-1978 kanji allographs may be one reason why NEC has not adopted the latest 


version, namely JIS X 0208-1990. 


Treatment of These 28 Kanji in Kanji Reference Works 


I have consulted five kanji reference works that I consider to be authoritative. 
Table 5-4 on page 46 summarizes my findings, and provides evidence to support 
my claim that most of these 28 simplified kanji under discussion did not exist prior 


to their introduction into the JIS X 0208-1983 standard. 


“DBCS is an acronym for Double Byte Character Set. PC is an acronym for Personal Computer. 


“However, the standard Japanese character set on the Japanese version of Windows, both NEC 
and Microsoft versions, is based on JIS X 0208-1990. It is the core system font of the non-windowing 
environment that is still based on JIS C 6226-1978. 
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One of the oldest authorities on kanji, the Koki Jiten (EE BRS ‘The Koki 
Character Dictionary’), also known as Zhang et al. (1716), is Chinese, not Japanese, 
in origin.” Zhang et al. (1716) includes all 28 traditional kanji allographs, and two 


of the simplified ones, which are expressly treated as variants. 


Dai Kanwa Jiten (Ki@F FER ‘Large Kanji Dictionary’), also known as 
Morohashi (1955-1960), like Zhang et al. (1716), provides all 28 traditional kanji 
allographs. In addition, three of the simplified kanji allographs are in its pages, 


and are expressly treated as variants of the traditional allograph. 


Shin Jigen (#tF-WR ‘New Character Origins’) is a small, yet authoritative 
kanji reference work in regard to the treatment of kanji allographs. It is known as 
Ogawa et al. (1968). It contains all 28 of the traditional kanji allographs, but only 


five of the simplified ones, all of which are expressly treated as variants. 


Dai Kango Rin (Ki®###K ‘Large Chinese Word Forest’), written by two 
students of Morohashi (1955—1960), is an authoritative contemporary kanji reference 
work. It is known as Kamada & Yoneyama (1992). This reference work is unique 


in that it has traditional kanji dictionary features, yet covers all 12,156 kanji 


‘In Chinese the name of this dictionary is read Kangxi Zididn. 
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allographs that are enumerated in JIS X 0208-1990 and JIS X 0212-1990. All 28 
traditional/simplified kanji pairs are included, by virtue of the fact that both are 
found in the union of the two JIS standards listed above. Kamada & Yoneyama 
(1992: 10), in fact, acknowledges this fact with five examples. However, the 
simplified allographs are listed as either variants or simplifications of the traditional 


kanji allographs. 


Although it does not contain nearly as many kanji as other kanji reference 
works, the New Japanese-English Character Dictionary (RR FIL Shin Kan’ei 
Jiten), known as Halpern (1990, 1993), is very accurate. Seven of the 28 traditional 
kanji allographs are listed, but none of the 28 simplified ones are, not even as 


variants. 


Table 5-4 illustrates which of the 28 simplified kanji allographs are found in 
the first four cited reference works. The fifth, Halpern (1990, 1993), is not included 


in this table because it does not contain any of the 28 simplified kanji allographs. 
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Table 5-4: The Treatment of 28 Simplified Kanji in Kanji Reference Works 


JIS X 0208-1983 Kokijiten Dai Kanwa/Jiten Shin Jigen Dai Kango Rin 


Pe 2202 variant variant variant variant 

if 1602 variant variant 

Ii 1990 simplified 
#S 3925 simplified 
fi 3722 simplified 
kB 2840 variant variant variant 

fi 3363 simplified 
ff] 3647 simplified 
#8 5825 variant simplified 
BE 4014 variant 

He 3834 simplified 
i 1775 simplified 
#2 3373 simplified 
# 3788 variant simplified 
i 2911 simplified 
YE 2350 simplified 
$€ 4573 simplified 
*F 3053 simplified 
iH 4725 simplified 
4X 2277 simplified 
tf 3063 simplified 
WE 4016 simplified 
i 4343 simplified 
Ha 3731 variant variant variant 

BE 1810 simplified 
wk 2420 simplified 
KO 2577 simplified 
Wl 4445 simplified 
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It is important to emphasize that all 28 simplified kanji allographs are in Kamada & 
Yoneyama (1992) by virtue of the fact that it was designed to include all 12,156 


kanji from JIS X 0208-1990 and JIS X 0212-1990. 


Table 5-4 argues that the majority of these 28 simplified kanji allographs 
never existed until they were created by JSA for the JIS X 0208-1983 standard. 
Many of these simplified allographs, such as (%, #2, and #4, demonstrate consistent 
treatment as variants in these kanji reference works. The remaining simplified 


kanji, which appear nowhere in any authority, are substantially artificial. 


The February 1, 1994 issue of The Yomiuri Shimbun included an article 
entitled “Koseki Kompyuta-ka de Goji Zokuji wa Teisei-shimasu” (FP #§4 YE a— 
FAG CHE + RE FILETIEU EF ‘Incorrect and Variant Characters to Be Corrected 
by the Computerization of Family Registration’). This article describes a Japanese 
government plan to forcibly correct kanji used in Japanese personal names. 155 
kanji are treated as exceptions to this plan, and are separated into two categories. 
The first category contains 15 kanji pairs, and the simplified form of each is 
allowed in existing names, but not for future names. The second category contains 


140 kanji pairs, and the simplified form of each is allowed for existing and future 
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names. Of the 28 simplified kanji in this discussion, 25 were in this article’s second 
category. Only the following three simplified allographs were not listed: B, #, 


and YE. 


This article is missing some important information, such as reasons why 
these 155 kanji are being treated as exceptions to the plan. Table 5-4 provided 
evidence which shows that most of these 28 simplified allographs did not exist 
prior to their introduction into JIS X 0208-1983, so why is their existence being 
acknowledged? I believe that the answer is similar to the reason why all 28 
simplified allographs appear in Kamada & Yoneyama (1992), namely that in 1990, 
JSA decided to leave these 28 simplified allographs in the JIS X 0208 series, and 
restored their traditional allographs in JIS X 0212-1990. My claim here is that these 
incorrect forms propagated to public use through the widely-implemented JIS X 
0208-1983 and JIS X 0208-1990 standards (both of which use the same 28 simplified 
allographs). The typical Japanese assumes that the kanji allographs available for 
input on their computer system are the correct forms. The success of computerizing 
family registration hinges on the availability of characters in electronic character 


sets, which is a concrete, not theoretical matter. 
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Although I have shown that a few of the 28 simplified kanji allographs have 
existed as variants before 1983, I still maintain that these kanji were created 
through the principles of prescriptive simplification, at least for the JIS X 0208-1983 
standard. However, it is difficult to find hard evidence to back up this claim. The 
JIS X 0208-1983 standard committee notes might be one possible source of such 


evidence. 


Kanji Restored in JIS X 0208-1983 and JIS X 0208-1990 


Although not direct evidence for prescriptive simplification, many traditional 
kanji allographs were restored in JIS X 0208-1983 and JIS X 0208-1990. This means 
that variant or simplified kanji allographs originally occupied a given code point in 
the character set, and in either 1983 or 1990, such forms were substituted with 


their traditional allographs. 


The transitions between earlier and later versions of the JIS X 0208 series is 
generally viewed as involving only simplification or the adoption of variant kanji 
allographs. My research shows that the restoration of traditional kanji allographs 


was also part of this transitional process. 
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Examining the transition between JIS C 6226-1978 and JIS X 0208-1983 
found 18 traditional kanji allograph restorations. A similar study of the transition 
between JIS X 0208-1983 and JIS X 0208-1990 uncovered seven. The kanji reference 
works listed in the previous section were used to verify traditional and variant 


kanji allographs. Tables 5-5 and 5-6 detail my findings. 


Table 5-5: Kanji Restored in JIS X 0208-1983 


JIS C 6226-1978 (Variant) JIS X 0208-1983 (Traditional)”’ 
f8 2346 8 
KR 2368 be 
Fi 2523 ia 
i =©62684 Alt 
Ht 2725 Sit 
fll 3347 fill 
#3 3830 ay 
Wi 3853 Wiki 
tf 3860 ie 
TE A548 TS 
Me = 5175 Hin 
#2 5984 Fe 
Mi §=6752 fii 
it §=7673 Bt 
@ 8037 4 
BA 8085 Te 
fl 8213 faa] 
4h 8393 4 


"The allographs in this column also correspond to the forms found in JIS X 0208-1990. 
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Table 5-6: Kanji Restored in JIS X 0208-1990 


JIS X 0208-1983 (Variant)"* JIS X 0208-1990 (TraditionaD 
i 5024 4 
HE 6089 te 
HE 6165 BE 
Yi 6254 1 
hE 67125 is 
MR 7485 ae 
ff 8247 ihe 


Tables 5-5 and 5-6 show that JSA attempted to improve the JIS X 0208 
character set during both revisions by restoring traditional kanji forms. Nomura 
(1984: 5) makes reference to this restoration process during the transition of JIS C 
6226-1978 to JIS X 0208-1983, but groups such restorations into a category of 


miscellaneous changes—not terribly helpful to the reader. 


Fujitsu Limited’s FMR Kanji and IBM’s DBCS-PC have always followed the 
traditional kanji forms as provided in Tables 5-5 and 5-6, even before the 1983 and 


1990 versions of JIS X 0208 were released.” 


The allographs in this column also correspond to the forms found in JIS C 6226-1978. 


“The kanji allograph fA (the JIS X 0208-1983 allograph) was already in an encoding region of 
IBM’s DBCS-PC that does not correspond to the JIS X 0208 series (called JBM Select Kanji). The 
kanji allograph 71 (the JIS C 6226-1978 allograph) is found in the portion of DBCS-PC that 
corresponds to the JIS X 0208 series. 
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Phenomena in PRC 


The First Kanji Simplification 


The first kanji simplification of PRC was introduced in January of 1955. The 
kanji simplifications brought forth became official in January of 1956. The first 
kanji simplification merely standardized the way in which the Chinese people 
were already writing kanji. It was accepted—and welcomed—by the people. The 
first kanji simplification falls into the category of spontaneous and descriptive 
simplification. Example kanji from the first kanji simplification can be found in 


Chapter 4. 


The PRC’s first kanji simplification comprised 2,238 simplified kanji, separated 
into three tables.” Table 4-6 on page 28 illustrated how a simplified radical can 
propagate its simplified form to kanji that use said radical. There are two types of 
radicals that cause this: left-side (fi ben) and right-side (5# D6). The first of these 
three tables lists 352 kanji that were simplified without the propagation of simplified 
left- or right-side radicals, the second table lists 132 kanji that represent the 


simplified left- and right-side radicals themselves, and the third table lists 1,754 


All 2,238 of these simplified kanji forms are found in GB 2312-80, PRC’s standard electronic 
character set standard. GB 2312-80 contains a total of 6,763 kanji. 
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kanji that were simplified by propagating simplified left- and right-side radicals. 


Endo (1986: 30-57) provides the three tables listing all 2,238 simplified kanji. 


The propagation of simplified left- and right-side radicals to kanji that use 
such radicals can be considered prescriptive simplification, but it is the second 
kanji simplification, described next, that offers better testimony for the existence of 


this phenomenon in PRC. 


The Second Kanji Simplification 


Plans for a second kanji simplification in PRC were announced on December 
20, 1977. The Chinese Character Reform Committee (FEIN Sikh AS Zhonggu6é 
Wénzi Gdigé Wéiyudnbui)”’ analyzed the simplifications from the first kanji 
simplification, then applied these principles to other kanji. The goals of the second 
kanji simplification were to further simplify the structure of kanji, from an average 
16-strokes-per-kanji down to eight strokes, and to reduce the number of kanji by 


collapsing several kanji into a single form. 


In order to reduce the average number of strokes, many kanji which never 


had a simplified form had one assigned. According to Endo (1986: 109), the 


*1In Japanese this name is romanized as Chiigoku Moji Kaikaku linkai. 
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Chinese Character Reform Committee was the organization that decided upon the 


simplified forms. 


Reducing the average number of strokes per kanji was an important part of 
this plan in that it eases the learning of kanji. Remember that students in PRC are 


required to learn approximately 3,600 kanji during the first six years of education. 


According to Harada (1989: 206) and Endo (1986: 116), the simplifications 
proposed in this second kanji simplification are considered much more extreme 
than the first, namely that a greater degree of simplification is being applied. An 
example is the kanji [Ml (men ‘surface’) being simplified into the form M4. Another 
example is the kanji #, (shi ‘I’) reducing to Js. These simplifications follow two of 
the principles described in Chapter 4, namely outlining and shape extraction. 
Furthermore, Harada (1989: 206-207) writes that even though the public had 
become adjusted to the first kanji simplification, introducing even more simplified 


forms caused confusion. 


The second kanji simplification plan resulted in 853 simplified kanji, falling 


into two tables.” The first table contains 248 kanji, all of which have won general 


“None of these simplified kanji forms are found in GB 2312-80, PRC’s standard electronic character 
set standard. 
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public acceptance. 172 do not use left- and right-side radicals, 21 do, and the 
remaining 55 kanji were simplified through analogy. The second table contains 
605 kanji which have not been fully accepted by the public. 245 do not use left- 
and right-side radicals, 24 do, and the remaining 336 kanji were simplified through 


analogy. This data is taken from Endo (1986: 108). 


The 248 kanji that have been accepted by the public are not at issue—they 
can fall into the category of spontaneous or descriptive simplification. The remaining 
605 kanji are substantially artificial, and thus can be classified as prescriptive 


simplifications. 


Harada (1989: 207) writes that simplified kanji from the second kanji 
simplification were used in several newspapers from January of 1978, but by June 
of that year, they were no longer being used. In January of 1986, the Chinese 
National Language Character Construction Committee (PRIRIZ#& 8 MF LVFB ES 
= Zhonggu6 Guojia Yiuyan Wénzi Gongzud Wéiyudnbui)” announced that the 
second kanji simplification was rescinded. The reasons given were that such 
simplifications would cause problems in producing dictionaries and the like, and 


also in computer processing of Chinese. 


*°In Japanese this name is romanized as Chaigoku Kokka Gogen Mofi Kosaku linkai. 


Appendix A 
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JIS C 6226-1978 Versus 
JIS X 0208-1983 


This appendix details the changes that occurred between the 1978 and 1983 


versions of JIS X 0208. I limit this appendix to changes in kanji—the non-kanji 


changes are not listed here. The changes made to kanji fall into three categories. 


The set of changes in category 3 are the most controversial. 


Category 1 


Four simplified kanji were substituted for the original shapes, which were 


assigned a new code position at the end of JIS Level 2 kanji in row 84. 


Original Code 
FE 2238 
Hi 4374 
j@ 4558 
EG 6486 


Added Code 
#2 8401 
H 8402 
#4 8403 
He 8404 
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Category 2 


22 simplified and traditional kanji pairs exchanged code positions between 


JIS Level 1 kanji and JIS Level 2 kanji (44 affected code points). 


JIS C 6226-1978 JIS X 0208-1983 

JIS Level 1 JIS Level 2 JIS Level 1 JIS Level 2 
f® 1619 ff 8245 ff 1619 fs 8245 
% 1809 ee 8284 wm 1809 4 8284 
WE =6«1934 Wi 7358 i §=61934 i =67358 
1941 fd =5788 fe 861941 ft 625788 
fi 1986 fi 6762 fi =1986 fa 6762 
¥E = =2035 YE 6285 Ye 2035 WE ©6285 
@ 2050 BR 67561 # © 2050 2 87561 
8A 2359 38 «68084 3A 2359 SA 8084 
i 2560 Wk 6672 fk 2560 We 6672 
#S 2841 && 7302 2841 # 7302 
M3157 #) 8055 HY) = 3157 #8055 
HE 3308 HE 7645 RE 3308 HE 7645 
a 3659 a 5268 Hf 3659 a 5268 
WA = =3755 th 6674 ih = 3755 i =6674 
i 3778 Ke 865977 Ke 3778 Re 5977 
i 3783 Y 6225 % 3783 i 6225 
2 3886 ¥& 7778 Yb 3886 @ 7778 
WE 83972 WH 867404 WE 3972 WE 67404 
i 4116 KE 5956 KE 4116 HE 5956 
fii 4389 WR 4854 (KR -4389 fii 4854 
Bo 4489 7314 #4489 Bo 7314 
fig 4722 i ©6838 f# 4722 fi 6838 


Category 3 
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The shapes of many kanji were altered, including several simplifications. 


This category of change is quite subjective, and what appears below are 250 such 


kanji. The source used here is the Adobe Japanese Character Collection (called 


Adobe-Japan1-0). The asterisked items are the 28 kanji allographs that were restored 


in JIS X 0212-1990. Those annotated with hash marks are traditional form restorations. 


1978 1983 

Ni i =§=61602* 
% 4 1609 
= FF 1618 
ff #&f 1627 
Yi Ym 1678 
fj =f}: 1683 
we YW 1692 
ma i LID 
fe  & 1721 
fe ie 1725 
Mm Mm 1729 
fis Ss«1734 
fA RA s1:775* 
fA = LC «1808 
He 2 1810* 
wi 6p «18644 
mm th 1890 
i «=©6f )=— 1893 
Ht = =©6 BC s«-1902 
Wig WE} §=61969 


1978 1983 
wm OB 
Hy Ry 
1 
fal = ital 
ma 
a BL 
HB 
mx 
POR 
7 | 
a 
Si AK 
is 
i A 
a 
| 
ms = -# 
on 
i 
| OR 


1975 
1983 
1990* 
2034 
2045 
2069 
4 Be 
2132 
Z2a0nr 
2210 
2247 
2277* 
2284 
fA 1 
2293 
2304 
2320 
2337 
2346# 
2350* 


1978 1983 
Fl iif =2353 
Bee KR 2368# 


f@ f& 2381 
he 86 = 2389 
fe 86 f& 2394 
Mae =| RA DO* 
vw 2 2433 
45 # 2511 
hm 860s 25234 
0 OY = 2534 
RE 38 = 2577* 
fie fi 2584 
fi HH 2589 
RR Se 2651 
ff = Rs 20671 
At 6 OY: Ss 26. 8.4% 
ie 8 be 2707 
fi; ff 2710 
Si 8=68f §=6©2712 
mt dW 2725+ 


1978 1983 1978 1983 1978 1983 


jf BE 2840* wR wh 3429 i tt  3788* 
we 86 2855 BH OB 3445 %  3830# 
A 62861 Be i 3460 fH 86% §63834* 
fy 3 2862 fe oR 3467 i? =o §=6-3.852 
ii 60H 291% K HK 3479 Mi WC 35 3H 
fe pH 2922 YE Ye 3485 jf WA 3859 
fe HE 2976 KK KK 3486 i WW 3860# 
YW 2977 i wh 3493 HS KS 3865 
a 62982 Se 838 3507 ae ae 3870 
# 869A = 2983 Wy = 3509 me ©6387 
"=F 3005 Ai 6 ON Ss 3510 he = RR O3874 
me 86k ©=63019 fe 3513 i t# 3909 
fA 0S #32031 @ RE 3514 ge 3 3925% 
He $$ 3053%* @ i 3529 ha@ =f 3955 
i 86 = 3063* ak #& 3580 ja je 3971 
ke 8 3068 Ri =H 3585 A FF 3973 
fh = ft «= 3110 Yl = oY] «3592 #) = #) 3977 
i 8) 3157 { $6 3629 & FF 4004 
iw j2 3164 RE = RSs 36.40 w 8 BE 4014* 
7A BR 3173 fe 86h §=63642 Bt BE 4016* 
4 848 3202 Bm BR 3645 8 8=6ff §=4052 
wi =6pe Cs 3:23. i] = fi} ss 36.47% BR #R 4066 
ie 3270 it 0k («3652 hi = hl, |S 4085 
fe #8 3281 Gi = oBSC3702 Re =f «4102 
fe t 3282 He te 3707 fi =f 4103 
Ri = RY 32289 33 6 SC i jw 4115 
fa tm 3290 a = SR 37722 re RSA118 
ii # 3307 me 86 RSC: me wm 4121 
Mm WY 3325 Sh =6S8 3740 ey iH 4132 
w= 33.44 je i 3743 B26 C4146 
fll fll 3347# S% F 3749 wi | OBA a149 
fe 8 3363* bf HR 3750 ma mK 4235 
je #8 3373* St $B 3768 em MW 4245 


1978 1983 


BSW SAAD PRRESRE RMR RNR SERS RSERRAMW SSR BAS 


L 


SM PANES RSM PRR HNRHPESESSRASREMW SSK STS 


= 
tae 


= 


4258 
4289 
A302 
A309 
4343* 
4380 
4388 
4445* 
4457 
4463 
4466 
4490 
4492 
4494 
4518 
4548# 
4552 
4573* 
4643 
4690 
A691 
4701 
4717 
4725* 
A929 
4939 
4943 
4945 
5116 
5126 
5162 
5175# 
5236 


1978 1983 
a 
5: 
OBR 
| tf 
RR 
fe HG 
mw if 
YE | OME 
i OR 
i | oh 
rn 
12H 
we OR 
7 ne 
a 
mm OR 
xz 
Fé 
rr 
we OB 
fk = AR 
OF 
OT 
: nS 
ie = R 
AF 
ae 
jy A 
HE FE 
ma 6 


5329 
5367 
5402 
5604 
5764 
5776 
5825* 
5845 
S337 
5942 
39°75 
5984# 
6248 
6406 
6463 
6511 
6516 
6517 
6611 
6679 
6742 
6752# 
6825 
6868 
6878 
6925 
6927 
6938 
7038 
7168 
ad3 
7182 
7220 


1978 1983 


eeSP RRSP oR Mee ESR SR a SR BH 


Ran 


eRe SRP RRS RMR PMR RES ESR aS aS Se 


7227 
7284 
7407 
7429 
7479 
7518 
7570 
7586 
76734 
7684 
7746 
7776 
7778 
7809 
7829 
7855 
8037# 
8049 
8055 
8085# 
8213# 
8228 
8246 
8349 
8393# 


61 


Appendix B JTS X 0208-1983 Versus 
JIS X 0208-1990 


This appendix details the changes that occurred between the 1983 and 1990 
versions of the JIS X 0208 standard. These changes are very minor, affecting 145 


kanji allographs. None of the changes are considered to be simplifications. 


The following two kanji were appended after JIS Level 2 kanji in row 84. 


These kanji represent recent additions to the Jinmei-yO Kanji set. 


# 8405 
RE g406 


The following is a complete list of the 145 kanji affected by subtle and 
slight allograph changes. Those annotated with a hash mark are traditional form 


restorations. 


1983 1990 


Be TRE RTH SRE HORE AR SERS ER 


SRR A S eS aS 


fH HM eR AR RS ASM RR ARERR SR SEO K KARR RE 


1646 
1662 
1667 
L725 
1750 
1768 
1772 
1784 
1807 
1920 
1921 
1944 
2026 
2072 
2102 
2171 
2249 
2370 
2374 
2382 
2390 
2482 
2488 
Boao 
2527 
2537 
2542 
2545 
2563 
2373 
2665 
2740 
2743 


1983 1990 


SPR ES eHY RE RE SS 


Z 


SMR Ss 


Bul 
te 


a = om + 


<a 
~ 
co 


aS = se 


pb 


2857 
2893 
2954 
2956 
3030 
3057 
3070 
3093 
3106 
3305 
3377 
3379 
3392 
3534 
3549 
i he ie BD 
3615 
3616 
3623 
3677 
3678 
3690 
S77? 
3808 
3951 
3958 
4041 
4050 
4065 
4066 
4069 
4076 
4080 


1983 1990 


SER RRSP RHWSSOK D 


cd 
= 


SBSMHMSSBSRSHRSA RS BS 


> ay 


mee a SR aa Ss SS 


a= 
=5 


HS 2ZEM HS BSS RES ER SRS BD ew 


4147 
4167 
4212 
4220 
4222 
4223 
4256 
4363 
A381 
4477'7 
4589 
4657 
4679 
4694 
5024# 
5054 
5140 
S203 
sas 
5324 
5458 
5460 
Sau5 
5549 
5617 
5714 
5819 
5841 
5880 
5932 
5955 
6031 
6050 


1983 1990 1983 1990 
6077 
6089# 
6165# 
6254+# 
6284 
6288 
6301 
6402 
6466 
6474 
6501 
6601 
6689 
6758 
6941 
6960 
7039 
7060 
7066 
7067 
7104 
TL2aF 
7243 
7384 
7436 
7449 
7474 
7485# 
7488 
7504 
fas 
4335 
7652 


7655 
7722 
7726 
7950 
8007 
8049 
8055 
8083 
8103 
8219 
8244 
8247# 
8294 


a SES mm SS a a BS St 
at Se Se Et Gf SS oh Be En SS St 


aD Ean RRC ae ORS 


Boal 
iy 


Ree ARCS RRHRRLALERHA SRE RS 
= 


Ty SS Se OR OSH oo OG BS Ht 
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Appendix C JIS C 6226-1978 Kanji Restored 
in JIS X 0212-1990 


This appendix lists the 28 kanji allographs that disappeared from the JIS X 


0208 series in 1983, and were subsequently restored in JIS X 0212-1990. 


JIS C 6226-1978 JIS X 0208-1983 JIS X 0212-1990 
Traditional Form Simplified Form Restored Traditional Form 
it? 1602 ii =§=1602 i? = =2164 

A 1775 WA 1775 14 4179 

i 1810 KS 1810 MS 7631 

i = 1990 i =91990 Wg §=2258 

HE 2202 hE 2202 (— 1734 

Wt = 2277 HK 2277 Wa = =6452 
2350 WE 2350 Be 5258 

fie = =2420 WR 2420 WR 7659 

8 2577 #8 2577 #8 7679 

jt 2840 PB 2840 jz 2690 

ii = 2911 i 2911 i =6-5255 

= 3053 *F 3053 He 5722 

% 3063 3063 ©6683 

Re «3363 f& 3363 3243 

J 3373 #2 = 3373 J 4587 
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JIS C 6226-1978 JIS X 0208-1983 JIS X 0212-1990 
Traditional Form Simplified Form Restored Traditional Form 
{9 3647 fi] §=3647 i = =93259 

HA 3722 i 3722 f@ 2420 

BA = 3731 HE 3731 Be 7219 

i 3788 Hf 3788 fi 4880 

We 3834 He 48603834 W@W 4112 

#2 3925 3 3925 He 2276 

# 4014 YE 4014 HF 4053 

RE 4016 BE 4016 BE 6687 

WA 4343 fH 4343 i 7204 

Hi 4445 Hil §=94445 Ki 7680 

Be = 4573 3 4573 He ©5639 
«4725 4725 WI -45988 

#8 5825 # =5825 #8 3334 
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